Multiview Point Based Cuckoo Search Clustering Algorithm for Privacy Preserving on Multiple Sensitive Attributes with Horizontally Partitioned Data

نویسنده

  • J. Anitha
چکیده

Developing data mining techniques both suitable for databases as well as to maintain the individual privacy becomes the primary aim of research based on privacy preserving data mining (PPDM). The present PDDM clustering methods perform privacy preservation only with single point of view in which every tuple in the data matrix for data holder containing single sensitive attribute is been considered. In case of the clustering approaches based on multi-view point clustering, multiple sensitive attributes in a tuple has to be considered which remains inattentive. Considering these drawbacks, present study aims on multiple sensitive attributes (MSA) for observing new privacy risks and to investigate multi-view point based clustering methods in case of unknown data. Prior to solving that data disambiguation problem using Ramon-Gartner subtree graph kernel (RGSGK), the weight values are assigned for determining the kernel value for disambiguated data. The privacy is then gained from RGSGK for converted data matrix samples followed by creation of secure key for each data holder matrix with the help of Improved Ron Rivest, Adi Shamir and Leonard Adleman (IRSA). The proposed framework based on the multiview point based cuckoo search algorithm clustering is a novel context for tackling the problem of privacy preservation in the multiple sensitive attribute (MSA). This work constitutes distance, similarity and dissimilarity matrix for carrying out the clustering process. Comparison of the experimental result of proposed MVCSA Clustering algorithm with conventional methods has been done in terms of the F Measure, running time, less privacy and utility loss, communication cost for UCI machine learning datasets such as adult dataset and house dataset.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Privacy Preserving Multiview Point Based BAT Clustering Algorithm and Graph Kernel Method for Data Disambiguation on Horizontally Partitioned Data

Data mining has been a popular research area for more than a decade due to its vast spectrum of applications. However, the popularity and wide availability of data mining tools also raised concerns about the privacy of individuals. Thus, the burden of data privacy protection falls on the shoulder of the data holder and data disambiguation problem occurs in the data matrix, anonymized data becom...

متن کامل

Privacy Preserving Data mining with Reduced Communication overhead

In today's world privacy and security are more essential elements when data is shared. A fruitful direction for future data mining research will be the development of techniques that incorporate privacy concerns. Most of the methods use random permutation techniques to mask the data, for preserving the privacy of sensitive data. The approaches for privacy preserving data mining suffer from high...

متن کامل

EM-Based Clustering with Privacy Preserving

The aim of this work is to propose a privacypreserving EM-based clustering algorithm for horizontally partitioned data sets between two parties. To this end, we propose basic protocols based on oblivious polynomial evaluation and prove the secrecy of protocols based on the semi-honest security model and the composition theorem. We have also given an extension of the proposed method to address t...

متن کامل

The Privacy of k-NN Retrieval for Horizontal Partitioned Data -- New Methods and Applications

Recently, privacy issues have become important in clustering analysis, especially when data is horizontally partitioned over several parties. Associative queries are the core retrieval operation for many data mining algorithms, especially clustering and k-NN classification. The algorithms that efficiently support k-NN queries are of special interest. We show how to adapt well-known data structu...

متن کامل

Improved COA with Chaotic Initialization and Intelligent Migration for Data Clustering

A well-known clustering algorithm is K-means. This algorithm, besides advantages such as high speed and ease of employment, suffers from the problem of local optima. In order to overcome this problem, a lot of studies have been done in clustering. This paper presents a hybrid Extended Cuckoo Optimization Algorithm (ECOA) and K-means (K), which is called ECOA-K. The COA algorithm has advantages ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015